Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 31648 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 4.6 MiB |
| Average record size in memory | 152.0 B |
Variable types
| NUM | 14 |
|---|---|
| CAT | 5 |
Reproduction
| Analysis started | 2020-05-31 13:05:25.112592 |
|---|---|
| Analysis finished | 2020-05-31 13:05:54.473255 |
| Duration | 29.36 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
country has a high cardinality: 78 distinct values | High cardinality |
state has a high cardinality: 69 distinct values | High cardinality |
city has a high cardinality: 5905 distinct values | High cardinality |
10k is highly correlated with 5k and 9 other fields | High correlation |
5k is highly correlated with 10k and 8 other fields | High correlation |
20k is highly correlated with 5k and 9 other fields | High correlation |
half is highly correlated with 5k and 9 other fields | High correlation |
25k is highly correlated with 5k and 9 other fields | High correlation |
30k is highly correlated with 5k and 9 other fields | High correlation |
35k is highly correlated with 5k and 9 other fields | High correlation |
40k is highly correlated with 5k and 9 other fields | High correlation |
official is highly correlated with 5k and 9 other fields | High correlation |
pace is highly correlated with 5k and 9 other fields | High correlation |
overall is highly correlated with 10k and 9 other fields | High correlation |
genderdiv is highly correlated with overall | High correlation |
bib has unique values | Unique |
| Distinct count | 1433 |
|---|---|
| Unique (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22636710970294452 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0000000000000002 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1525252525 |
| Q1 | 0.1877525253 |
| median | 0.2184343434 |
| Q3 | 0.2607323232 |
| 95-th percentile | 0.3207070707 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.07297979798 |
Descriptive statistics
| Standard deviation | 0.05322533827 |
|---|---|
| Coefficient of variation (CV) | 0.2351284086 |
| Kurtosis | 1.771341124 |
| Mean | 0.2263671097 |
| Median Absolute Deviation (MAD) | 0.03535353535 |
| Skewness | 0.5653030277 |
| Sum | 7164.066288 |
| Variance | 0.002832936634 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.2042929293 | 81 | 0.3% | |
| 0.2036616162 | 79 | 0.2% | |
| 0.1986111111 | 78 | 0.2% | |
| 0.2061868687 | 75 | 0.2% | |
| 0.2111111111 | 74 | 0.2% | |
| 0.2064393939 | 72 | 0.2% | |
| 0.1946969697 | 72 | 0.2% | |
| 0.2011363636 | 72 | 0.2% | |
| 0.2060606061 | 71 | 0.2% | |
| 0.2108585859 | 71 | 0.2% | |
| Other values (1423) | 30903 | 97.6% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 0.003156565657 | 1 | < 0.1% | |
| 0.003409090909 | 4 | < 0.1% | |
| 0.007954545455 | 2 | < 0.1% | |
| 0.008207070707 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.558459596 | 1 | < 0.1% | |
| 0.5186868687 | 1 | < 0.1% | |
| 0.5101010101 | 1 | < 0.1% | |
| 0.4953282828 | 1 | < 0.1% |
| Distinct count | 2659 |
|---|---|
| Unique (%) | 8.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3655919974301697 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.240497076 |
| Q1 | 0.300229741 |
| median | 0.3515037594 |
| Q3 | 0.421679198 |
| 95-th percentile | 0.532059315 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.121449457 |
Descriptive statistics
| Standard deviation | 0.09095901505 |
|---|---|
| Coefficient of variation (CV) | 0.2487992508 |
| Kurtosis | 0.4535122883 |
| Mean | 0.3655919974 |
| Median Absolute Deviation (MAD) | 0.05931495405 |
| Skewness | 0.5585924837 |
| Sum | 11570.25553 |
| Variance | 0.008273542419 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.3316624896 | 47 | 0.1% | |
| 0.337823726 | 47 | 0.1% | |
| 0.3355263158 | 42 | 0.1% | |
| 0.3279030911 | 42 | 0.1% | |
| 0.3425229741 | 41 | 0.1% | |
| 0.3367794486 | 41 | 0.1% | |
| 0.3237259816 | 41 | 0.1% | |
| 0.3264411028 | 40 | 0.1% | |
| 0.3308270677 | 40 | 0.1% | |
| 0.3430451128 | 39 | 0.1% | |
| Other values (2649) | 31228 | 98.7% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 0.0052213868 | 3 | < 0.1% | |
| 0.005325814536 | 1 | < 0.1% | |
| 0.007832080201 | 1 | < 0.1% | |
| 0.01075605681 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9001670844 | 1 | < 0.1% | |
| 0.8182957393 | 1 | < 0.1% | |
| 0.8167293233 | 1 | < 0.1% | |
| 0.8127610693 | 1 | < 0.1% |
| Distinct count | 5228 |
|---|---|
| Unique (%) | 16.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3677107411271664 |
|---|---|
| Minimum | 0.0 |
| Maximum | 0.9999999999999999 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.231997344 |
| Q1 | 0.2951925631 |
| median | 0.3500664011 |
| Q3 | 0.4252324037 |
| 95-th percentile | 0.5621912351 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1300398406 |
Descriptive statistics
| Standard deviation | 0.1017784116 |
|---|---|
| Coefficient of variation (CV) | 0.2767893352 |
| Kurtosis | 0.596195049 |
| Mean | 0.3677107411 |
| Median Absolute Deviation (MAD) | 0.06332005312 |
| Skewness | 0.7219371765 |
| Sum | 11637.30954 |
| Variance | 0.01035884507 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.3358831341 | 28 | 0.1% | |
| 0.3220717131 | 27 | 0.1% | |
| 0.3330677291 | 25 | 0.1% | |
| 0.3297211155 | 25 | 0.1% | |
| 0.3213811421 | 24 | 0.1% | |
| 0.3248339973 | 24 | 0.1% | |
| 0.3191500664 | 23 | 0.1% | |
| 0.3315803453 | 23 | 0.1% | |
| 0.3360956175 | 23 | 0.1% | |
| 0.3373173971 | 23 | 0.1% | |
| Other values (5218) | 31403 | 99.2% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 0.002549800797 | 1 | < 0.1% | |
| 0.002656042497 | 1 | < 0.1% | |
| 0.004674634794 | 1 | < 0.1% | |
| 0.00823373174 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9330677291 | 1 | < 0.1% | |
| 0.8702257636 | 1 | < 0.1% | |
| 0.8500398406 | 1 | < 0.1% | |
| 0.8152988048 | 1 | < 0.1% |
| Distinct count | 5489 |
|---|---|
| Unique (%) | 17.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.36841336781468886 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.2322140556 |
| Q1 | 0.295610149 |
| median | 0.3506343133 |
| Q3 | 0.4261478051 |
| 95-th percentile | 0.5643198752 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1305376561 |
Descriptive statistics
| Standard deviation | 0.1022286079 |
|---|---|
| Coefficient of variation (CV) | 0.277483438 |
| Kurtosis | 0.6026377855 |
| Mean | 0.3684133678 |
| Median Absolute Deviation (MAD) | 0.06322996375 |
| Skewness | 0.7274735953 |
| Sum | 11659.54626 |
| Variance | 0.01045068828 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.3219391865 | 27 | 0.1% | |
| 0.3321083367 | 26 | 0.1% | |
| 0.3191703584 | 24 | 0.1% | |
| 0.3285843737 | 23 | 0.1% | |
| 0.3411699557 | 23 | 0.1% | |
| 0.3254128071 | 23 | 0.1% | |
| 0.3215364478 | 22 | 0.1% | |
| 0.3304470399 | 22 | 0.1% | |
| 0.308195731 | 22 | 0.1% | |
| 0.3506343133 | 21 | 0.1% | |
| Other values (5479) | 31415 | 99.3% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 0.002869512686 | 2 | < 0.1% | |
| 0.005487313733 | 1 | < 0.1% | |
| 0.008507853403 | 2 | < 0.1% | |
| 0.01092428514 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9265002014 | 1 | < 0.1% | |
| 0.8720298027 | 1 | < 0.1% | |
| 0.8517418445 | 1 | < 0.1% | |
| 0.8216874748 | 1 | < 0.1% |
| Distinct count | 6556 |
|---|---|
| Unique (%) | 20.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3609134973026622 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.2233368966 |
| Q1 | 0.2862840227 |
| median | 0.3414604062 |
| Q3 | 0.4189622564 |
| 95-th percentile | 0.5666474796 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1326782337 |
Descriptive statistics
| Standard deviation | 0.1051387952 |
|---|---|
| Coefficient of variation (CV) | 0.2913130043 |
| Kurtosis | 0.6657078364 |
| Mean | 0.3609134973 |
| Median Absolute Deviation (MAD) | 0.06405723214 |
| Skewness | 0.7952161924 |
| Sum | 11422.19036 |
| Variance | 0.01105416625 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.3186415591 | 25 | 0.1% | |
| 0.3468875915 | 21 | 0.1% | |
| 0.2964394375 | 21 | 0.1% | |
| 0.3176959132 | 21 | 0.1% | |
| 0.3215196119 | 21 | 0.1% | |
| 0.3583175726 | 21 | 0.1% | |
| 0.2977551188 | 20 | 0.1% | |
| 0.3101307458 | 20 | 0.1% | |
| 0.30926733 | 20 | 0.1% | |
| 0.3267412219 | 19 | 0.1% | |
| Other values (6546) | 31439 | 99.3% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 0.002343557273 | 2 | < 0.1% | |
| 0.00678398158 | 1 | < 0.1% | |
| 0.007688512458 | 2 | < 0.1% | |
| 0.01130663597 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9705205164 | 1 | < 0.1% | |
| 0.9023517803 | 1 | < 0.1% | |
| 0.8769015706 | 1 | < 0.1% | |
| 0.8419537867 | 1 | < 0.1% |
| Distinct count | 7926 |
|---|---|
| Unique (%) | 25.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.33583957374459794 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.2029303285 |
| Q1 | 0.2627606039 |
| median | 0.3158503423 |
| Q3 | 0.3930547307 |
| 95-th percentile | 0.5379708061 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1302941268 |
Descriptive statistics
| Standard deviation | 0.1028032319 |
|---|---|
| Coefficient of variation (CV) | 0.3061081539 |
| Kurtosis | 0.6447323519 |
| Mean | 0.3358395737 |
| Median Absolute Deviation (MAD) | 0.06226362017 |
| Skewness | 0.8279680976 |
| Sum | 10628.65083 |
| Variance | 0.0105685045 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.280905198 | 19 | 0.1% | |
| 0.2994405026 | 18 | 0.1% | |
| 0.2796549245 | 17 | 0.1% | |
| 0.343200075 | 17 | 0.1% | |
| 0.3085674991 | 17 | 0.1% | |
| 0.3118494671 | 16 | 0.1% | |
| 0.3042853124 | 16 | 0.1% | |
| 0.2863126309 | 16 | 0.1% | |
| 0.2507736067 | 16 | 0.1% | |
| 0.2835620292 | 16 | 0.1% | |
| Other values (7916) | 31480 | 99.5% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 0.001812896571 | 2 | < 0.1% | |
| 0.006720220048 | 1 | < 0.1% | |
| 0.007126558935 | 1 | < 0.1% | |
| 0.008533116619 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9713062232 | 1 | < 0.1% | |
| 0.8367455381 | 1 | < 0.1% | |
| 0.8135217079 | 1 | < 0.1% | |
| 0.7912043259 | 1 | < 0.1% |
| Distinct count | 9342 |
|---|---|
| Unique (%) | 29.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3367644873239581 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1990783169 |
| Q1 | 0.2606111387 |
| median | 0.3153360739 |
| Q3 | 0.3978686078 |
| 95-th percentile | 0.5464965568 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.137257469 |
Descriptive statistics
| Standard deviation | 0.1067676473 |
|---|---|
| Coefficient of variation (CV) | 0.3170395079 |
| Kurtosis | 0.5190601842 |
| Mean | 0.3367644873 |
| Median Absolute Deviation (MAD) | 0.06504150192 |
| Skewness | 0.8164986246 |
| Sum | 10657.92249 |
| Variance | 0.01139933052 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.2853551884 | 17 | 0.1% | |
| 0.3112251578 | 17 | 0.1% | |
| 0.2770286193 | 16 | 0.1% | |
| 0.2647220549 | 16 | 0.1% | |
| 0.2956193868 | 15 | < 0.1% | |
| 0.2952266241 | 15 | < 0.1% | |
| 0.3015370113 | 15 | < 0.1% | |
| 0.2708229688 | 15 | < 0.1% | |
| 0.2624963997 | 14 | < 0.1% | |
| 0.2898064989 | 14 | < 0.1% | |
| Other values (9332) | 31494 | 99.5% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 0.00107355136 | 2 | < 0.1% | |
| 0.007357754445 | 1 | < 0.1% | |
| 0.008326569087 | 1 | < 0.1% | |
| 0.008928805216 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9750988453 | 1 | < 0.1% | |
| 0.8488387316 | 1 | < 0.1% | |
| 0.8199837658 | 1 | < 0.1% | |
| 0.7993506323 | 1 | < 0.1% |
| Distinct count | 10391 |
|---|---|
| Unique (%) | 32.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3518576171314184 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.206794679 |
| Q1 | 0.2718796992 |
| median | 0.3293811452 |
| Q3 | 0.4175130133 |
| 95-th percentile | 0.568189705 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1456333141 |
Descriptive statistics
| Standard deviation | 0.1112207779 |
|---|---|
| Coefficient of variation (CV) | 0.3160959789 |
| Kurtosis | 0.3720835143 |
| Mean | 0.3518576171 |
| Median Absolute Deviation (MAD) | 0.06860613071 |
| Skewness | 0.7709058553 |
| Sum | 11135.58987 |
| Variance | 0.01237006144 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.2995951417 | 16 | 0.1% | |
| 0.290503181 | 15 | < 0.1% | |
| 0.3057721226 | 15 | < 0.1% | |
| 0.300242915 | 14 | < 0.1% | |
| 0.328582996 | 13 | < 0.1% | |
| 0.2896934644 | 13 | < 0.1% | |
| 0.30264893 | 13 | < 0.1% | |
| 0.2977906304 | 13 | < 0.1% | |
| 0.2863389242 | 13 | < 0.1% | |
| 0.3305957201 | 12 | < 0.1% | |
| Other values (10381) | 31511 | 99.6% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 0.001388085599 | 1 | < 0.1% | |
| 0.001434355119 | 1 | < 0.1% | |
| 0.008629265471 | 1 | < 0.1% | |
| 0.008837478311 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9995835743 | 1 | < 0.1% | |
| 0.8788201272 | 1 | < 0.1% | |
| 0.8339386929 | 1 | < 0.1% | |
| 0.8263273569 | 1 | < 0.1% |
| Distinct count | 10848 |
|---|---|
| Unique (%) | 34.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.35208361312506 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0000000000000002 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.206598586 |
| Q1 | 0.2721043903 |
| median | 0.3303220738 |
| Q3 | 0.4180850135 |
| 95-th percentile | 0.5671489046 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1459806232 |
Descriptive statistics
| Standard deviation | 0.1109437003 |
|---|---|
| Coefficient of variation (CV) | 0.3151061173 |
| Kurtosis | 0.3291582404 |
| Mean | 0.3520836131 |
| Median Absolute Deviation (MAD) | 0.06914986471 |
| Skewness | 0.7517530927 |
| Sum | 11142.74219 |
| Variance | 0.01230850463 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.2573754037 | 14 | < 0.1% | |
| 0.3142183818 | 14 | < 0.1% | |
| 0.2788033517 | 14 | < 0.1% | |
| 0.3401195776 | 13 | < 0.1% | |
| 0.270096884 | 13 | < 0.1% | |
| 0.3116435367 | 13 | < 0.1% | |
| 0.3184297809 | 12 | < 0.1% | |
| 0.2747665183 | 12 | < 0.1% | |
| 0.2904774374 | 12 | < 0.1% | |
| 0.2999258095 | 12 | < 0.1% | |
| Other values (10838) | 31519 | 99.6% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 0.00137470542 | 2 | < 0.1% | |
| 0.008837391987 | 1 | < 0.1% | |
| 0.008946495592 | 1 | < 0.1% | |
| 0.009208344244 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.989831544 | 1 | < 0.1% | |
| 0.8710831806 | 1 | < 0.1% | |
| 0.8319586279 | 1 | < 0.1% | |
| 0.8225320765 | 1 | < 0.1% |
| Distinct count | 702 |
|---|---|
| Unique (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3520001462629808 |
|---|---|
| Minimum | 0.0 |
| Maximum | 0.9999999999999998 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.2069754145 |
| Q1 | 0.2715837621 |
| median | 0.3299028016 |
| Q3 | 0.4173813608 |
| 95-th percentile | 0.5671812464 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1457975986 |
Descriptive statistics
| Standard deviation | 0.1108735157 |
|---|---|
| Coefficient of variation (CV) | 0.3149814478 |
| Kurtosis | 0.3273589478 |
| Mean | 0.3520001463 |
| Median Absolute Deviation (MAD) | 0.06861063465 |
| Skewness | 0.7513709293 |
| Sum | 11140.10063 |
| Variance | 0.01229293648 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.303030303 | 162 | 0.5% | |
| 0.2813036021 | 150 | 0.5% | |
| 0.2973127501 | 150 | 0.5% | |
| 0.2858776444 | 149 | 0.5% | |
| 0.2801600915 | 149 | 0.5% | |
| 0.2995997713 | 149 | 0.5% | |
| 0.3339050886 | 148 | 0.5% | |
| 0.3138936535 | 147 | 0.5% | |
| 0.2710120069 | 146 | 0.5% | |
| 0.3253287593 | 145 | 0.5% | |
| Other values (692) | 30153 | 95.3% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 0.001143510577 | 2 | < 0.1% | |
| 0.008576329331 | 2 | < 0.1% | |
| 0.009719839909 | 2 | < 0.1% | |
| 0.01086335049 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9891366495 | 1 | < 0.1% | |
| 0.8702115495 | 1 | < 0.1% | |
| 0.8313321898 | 1 | < 0.1% | |
| 0.8227558605 | 1 | < 0.1% |
| Distinct count | 31595 |
|---|---|
| Unique (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.49586622792781354 |
|---|---|
| Minimum | 0.0 |
| Maximum | 0.9999999999999999 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.0478969621 |
| Q1 | 0.2467507047 |
| median | 0.4957250235 |
| Q3 | 0.7448872534 |
| 95-th percentile | 0.9442107736 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.4981365487 |
Descriptive statistics
| Standard deviation | 0.2875779691 |
|---|---|
| Coefficient of variation (CV) | 0.5799507062 |
| Kurtosis | -1.199877423 |
| Mean | 0.4958662279 |
| Median Absolute Deviation (MAD) | 0.249076104 |
| Skewness | 0.001798552736 |
| Sum | 15693.17438 |
| Variance | 0.08270108829 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 2 | < 0.1% | |
| 9.395552772e-05 | 2 | < 0.1% | |
| 0.0003758221109 | 2 | < 0.1% | |
| 0.0008142812402 | 2 | < 0.1% | |
| 0.001440651425 | 2 | < 0.1% | |
| 0.001096147823 | 2 | < 0.1% | |
| 0.0005637331663 | 2 | < 0.1% | |
| 3.131850924e-05 | 2 | < 0.1% | |
| 6.263701848e-05 | 2 | < 0.1% | |
| 0.0002192295647 | 2 | < 0.1% | |
| Other values (31585) | 31628 | 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 2 | < 0.1% | |
| 3.131850924e-05 | 2 | < 0.1% | |
| 6.263701848e-05 | 2 | < 0.1% | |
| 9.395552772e-05 | 2 | < 0.1% | |
| 0.000125274037 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9999686815 | 1 | < 0.1% | |
| 0.999937363 | 1 | < 0.1% | |
| 0.9998434075 | 1 | < 0.1% | |
| 0.9998120889 | 1 | < 0.1% |
age
Real number (ℝ≥0)
| Distinct count | 64 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.38705873738103264 |
|---|---|
| Minimum | 0.0 |
| Maximum | 0.9999999999999999 |
| Zeros | 33 |
| Zeros (%) | 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1111111111 |
| Q1 | 0.2380952381 |
| median | 0.380952381 |
| Q3 | 0.5079365079 |
| 95-th percentile | 0.6825396825 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.2698412698 |
Descriptive statistics
| Standard deviation | 0.1793616581 |
|---|---|
| Coefficient of variation (CV) | 0.4633964842 |
| Kurtosis | -0.56427208 |
| Mean | 0.3870587374 |
| Median Absolute Deviation (MAD) | 0.126984127 |
| Skewness | 0.1670120483 |
| Sum | 12249.63492 |
| Variance | 0.03217060439 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.4285714286 | 1191 | 3.8% | |
| 0.4444444444 | 1149 | 3.6% | |
| 0.3492063492 | 1075 | 3.4% | |
| 0.4603174603 | 1053 | 3.3% | |
| 0.380952381 | 1034 | 3.3% | |
| 0.5079365079 | 1004 | 3.2% | |
| 0.3650793651 | 976 | 3.1% | |
| 0.3968253968 | 961 | 3.0% | |
| 0.4761904762 | 904 | 2.9% | |
| 0.5238095238 | 897 | 2.8% | |
| Other values (54) | 21404 | 67.6% |
| Value | Count | Frequency (%) | |
| 0 | 33 | 0.1% | |
| 0.01587301587 | 41 | 0.1% | |
| 0.03174603175 | 111 | 0.4% | |
| 0.04761904762 | 171 | 0.5% | |
| 0.06349206349 | 275 | 0.9% |
| Value | Count | Frequency (%) | |
| 1 | 5 | < 0.1% | |
| 0.9841269841 | 3 | < 0.1% | |
| 0.9682539683 | 3 | < 0.1% | |
| 0.9523809524 | 5 | < 0.1% | |
| 0.9365079365 | 6 | < 0.1% |
gender
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 247.2 KiB |
| M | |
|---|---|
| F |
| Value | Count | Frequency (%) | |
| M | 17484 | 55.2% | |
| F | 14164 | 44.8% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
| Distinct count | 17490 |
|---|---|
| Unique (%) | 55.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4553331856406589 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1.0 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.04360703312 |
| Q1 | 0.224137931 |
| median | 0.4505519517 |
| Q3 | 0.6766814612 |
| 95-th percentile | 0.9055223626 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.4525435302 |
Descriptive statistics
| Standard deviation | 0.269335098 |
|---|---|
| Coefficient of variation (CV) | 0.59151212 |
| Kurtosis | -1.072844482 |
| Mean | 0.4553331856 |
| Median Absolute Deviation (MAD) | 0.2262717651 |
| Skewness | 0.09915551468 |
| Sum | 14410.38466 |
| Variance | 0.07254139499 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 4 | < 0.1% | |
| 0.0003414134517 | 4 | < 0.1% | |
| 5.690224195e-05 | 4 | < 0.1% | |
| 0.0005690224195 | 4 | < 0.1% | |
| 0.0001138044839 | 4 | < 0.1% | |
| 0.0004552179356 | 4 | < 0.1% | |
| 0.0002276089678 | 4 | < 0.1% | |
| 0.0005121201775 | 4 | < 0.1% | |
| 0.0001707067258 | 4 | < 0.1% | |
| 0.0002845112097 | 4 | < 0.1% | |
| Other values (17480) | 31608 | 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 4 | < 0.1% | |
| 5.690224195e-05 | 4 | < 0.1% | |
| 0.0001138044839 | 4 | < 0.1% | |
| 0.0001707067258 | 4 | < 0.1% | |
| 0.0002276089678 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9999430978 | 1 | < 0.1% | |
| 0.9998861955 | 1 | < 0.1% | |
| 0.999772391 | 1 | < 0.1% | |
| 0.9992602709 | 1 | < 0.1% |
division
Real number (ℝ≥0)
| Distinct count | 6921 |
|---|---|
| Unique (%) | 21.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.27540847901001014 |
|---|---|
| Minimum | 0.0 |
| Maximum | 0.9999999999999999 |
| Zeros | 23 |
| Zeros (%) | 0.1% |
| Memory size | 247.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.01390083118 |
| Q1 | 0.08684436801 |
| median | 0.2030667813 |
| Q3 | 0.3725995987 |
| 95-th percentile | 0.804335053 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.2857552307 |
Descriptive statistics
| Standard deviation | 0.244317063 |
|---|---|
| Coefficient of variation (CV) | 0.8871079926 |
| Kurtosis | 0.3354066435 |
| Mean | 0.275408479 |
| Median Absolute Deviation (MAD) | 0.1314130123 |
| Skewness | 1.113716829 |
| Sum | 8716.127544 |
| Variance | 0.05969082726 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 23 | 0.1% | |
| 0.000286615076 | 22 | 0.1% | |
| 0.000143307538 | 22 | 0.1% | |
| 0.0005732301519 | 20 | 0.1% | |
| 0.0008598452279 | 20 | 0.1% | |
| 0.0004299226139 | 20 | 0.1% | |
| 0.001003152766 | 20 | 0.1% | |
| 0.00143307538 | 19 | 0.1% | |
| 0.001289767842 | 19 | 0.1% | |
| 0.0007165376899 | 19 | 0.1% | |
| Other values (6911) | 31444 | 99.4% |
| Value | Count | Frequency (%) | |
| 0 | 23 | 0.1% | |
| 0.000143307538 | 22 | 0.1% | |
| 0.000286615076 | 22 | 0.1% | |
| 0.0004299226139 | 20 | 0.1% | |
| 0.0005732301519 | 20 | 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9997133849 | 1 | < 0.1% | |
| 0.9985669246 | 1 | < 0.1% | |
| 0.9984236171 | 1 | < 0.1% | |
| 0.9975637719 | 1 | < 0.1% |
| Distinct count | 78 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 247.2 KiB |
| USA | |
|---|---|
| CAN | 2164 |
| GBR | 341 |
| ITA | 209 |
| MEX | 202 |
| Other values (73) | 1793 |
| Value | Count | Frequency (%) | |
| USA | 26939 | 85.1% | |
| CAN | 2164 | 6.8% | |
| GBR | 341 | 1.1% | |
| ITA | 209 | 0.7% | |
| MEX | 202 | 0.6% | |
| GER | 180 | 0.6% | |
| JPN | 172 | 0.5% | |
| AUS | 123 | 0.4% | |
| IRL | 116 | 0.4% | |
| FRA | 113 | 0.4% | |
| Other values (68) | 1089 | 3.4% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
| Distinct count | 69 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 247.2 KiB |
| MA | |
|---|---|
| others | 2545 |
| CA | 2302 |
| NY | 1537 |
| ON | 1045 |
| Other values (64) |
| Value | Count | Frequency (%) | |
| MA | 7427 | 23.5% | |
| others | 2545 | 8.0% | |
| CA | 2302 | 7.3% | |
| NY | 1537 | 4.9% | |
| ON | 1045 | 3.3% | |
| PA | 997 | 3.2% | |
| TX | 988 | 3.1% | |
| IL | 911 | 2.9% | |
| OH | 754 | 2.4% | |
| FL | 745 | 2.4% | |
| Other values (59) | 12397 | 39.2% |
Length
| Max length | 6 |
|---|---|
| Median length | 2 |
| Mean length | 2.321663296 |
| Min length | 2 |
| Distinct count | 5905 |
|---|---|
| Unique (%) | 18.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 247.2 KiB |
| Boston | 1018 |
|---|---|
| New York | 497 |
| Chicago | 312 |
| Cambridge | 306 |
| Toronto | 239 |
| Other values (5900) |
| Value | Count | Frequency (%) | |
| Boston | 1018 | 3.2% | |
| New York | 497 | 1.6% | |
| Chicago | 312 | 1.0% | |
| Cambridge | 306 | 1.0% | |
| Toronto | 239 | 0.8% | |
| Somerville | 239 | 0.8% | |
| Brookline | 219 | 0.7% | |
| Washington | 210 | 0.7% | |
| Newton | 195 | 0.6% | |
| San Francisco | 192 | 0.6% | |
| Other values (5895) | 28221 | 89.2% |
Length
| Max length | 35 |
|---|---|
| Median length | 8 |
| Mean length | 8.799892568 |
| Min length | 2 |
| Distinct count | 31648 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 247.2 KiB |
| 26385 | 1 |
|---|---|
| 12607 | 1 |
| 16839 | 1 |
| 8995 | 1 |
| 12146 | 1 |
| Other values (31643) |
| Value | Count | Frequency (%) | |
| 26385 | 1 | < 0.1% | |
| 12607 | 1 | < 0.1% | |
| 16839 | 1 | < 0.1% | |
| 8995 | 1 | < 0.1% | |
| 12146 | 1 | < 0.1% | |
| 20518 | 1 | < 0.1% | |
| 20205 | 1 | < 0.1% | |
| 27315 | 1 | < 0.1% | |
| 28524 | 1 | < 0.1% | |
| 27608 | 1 | < 0.1% | |
| Other values (31638) | 31638 | > 99.9% |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.693914307 |
| Min length | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| 5k | 10k | 20k | half | 25k | 30k | 35k | 40k | official | pace | overall | age | gender | genderdiv | division | country | state | city | bib | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.003409 | 0.007832 | 0.008234 | 0.008508 | 0.007689 | 0.008533 | 0.010421 | 0.010017 | 0.010147 | 0.010863 | 0.000219 | 0.460317 | M | 0.000398 | 0.001003 | JPN | others | Fukuoka | W1 |
| 1 | 0.106944 | 0.166667 | 0.157928 | 0.158276 | 0.150604 | 0.134029 | 0.128172 | 0.129555 | 0.127324 | 0.126930 | 0.000626 | 0.238095 | F | 0.000000 | 0.000000 | KEN | others | Eldoret | F1 |
| 2 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.365079 | M | 0.000000 | 0.000000 | RSA | others | Paarl | W2 |
| 3 | 0.106692 | 0.166562 | 0.157928 | 0.158276 | 0.150604 | 0.134029 | 0.128172 | 0.131521 | 0.130881 | 0.130932 | 0.000814 | 0.095238 | F | 0.000114 | 0.000287 | ETH | others | Shoa | F2 |
| 4 | 0.003409 | 0.005221 | 0.002550 | 0.002870 | 0.002344 | 0.001813 | 0.001074 | 0.001434 | 0.001375 | 0.001144 | 0.000031 | 0.349206 | M | 0.000057 | 0.000143 | JPN | others | Nogata Fukuoka | W3 |
| 5 | 0.106944 | 0.166667 | 0.157928 | 0.158276 | 0.150687 | 0.134029 | 0.128172 | 0.131521 | 0.131099 | 0.130932 | 0.000846 | 0.174603 | F | 0.000171 | 0.000430 | KEN | others | Nandi | F3 |
| 6 | 0.007955 | 0.010756 | 0.008234 | 0.008508 | 0.007689 | 0.006720 | 0.007358 | 0.008629 | 0.008837 | 0.008576 | 0.000094 | 0.158730 | M | 0.000171 | 0.000430 | SUI | others | Neuenkirch | W4 |
| 7 | 0.093687 | 0.144737 | 0.135564 | 0.135018 | 0.128073 | 0.112743 | 0.104868 | 0.108039 | 0.107423 | 0.108062 | 0.000125 | 0.174603 | M | 0.000228 | 0.000573 | ETH | others | Addis Ababa | 5 |
| 8 | 0.003157 | 0.005221 | 0.002656 | 0.002870 | 0.002344 | 0.001813 | 0.001074 | 0.001388 | 0.001375 | 0.001144 | 0.000063 | 0.396825 | M | 0.000114 | 0.000287 | JPN | others | Isahaya | W6 |
| 9 | 0.093434 | 0.144737 | 0.136414 | 0.136931 | 0.131075 | 0.119807 | 0.117436 | 0.124303 | 0.124880 | 0.125214 | 0.000595 | 0.206349 | M | 0.001081 | 0.002723 | USA | CA | Redding | 6 |
Last rows
| 5k | 10k | 20k | half | 25k | 30k | 35k | 40k | official | pace | overall | age | gender | genderdiv | division | country | state | city | bib | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 31638 | 0.316288 | 0.538429 | 0.583373 | 0.586337 | 0.586013 | 0.540368 | 0.533319 | 0.552527 | 0.548202 | 0.548313 | 0.928187 | 0.206349 | M | 0.941163 | 0.802952 | USA | MA | Dorchester | 35901 |
| 31639 | 0.332955 | 0.552527 | 0.576627 | 0.578886 | 0.595140 | 0.566468 | 0.578356 | 0.602707 | 0.601379 | 0.601487 | 0.965424 | 0.603175 | M | 0.972516 | 0.247062 | USA | MA | Reading | 35902 |
| 31640 | 0.301136 | 0.494256 | 0.490465 | 0.490687 | 0.476729 | 0.450098 | 0.426226 | 0.460197 | 0.454417 | 0.454545 | 0.813467 | 0.238095 | M | 0.838113 | 0.715391 | USA | MA | Hyde Park | 35905 |
| 31641 | 0.278662 | 0.487782 | 0.501195 | 0.498137 | 0.491736 | 0.475354 | 0.469299 | 0.487033 | 0.485140 | 0.485420 | 0.860915 | 0.301587 | M | 0.880335 | 0.747492 | USA | MA | Boston | 35906 |
| 31642 | 0.370581 | 0.627924 | 0.678247 | 0.681736 | 0.675397 | 0.628731 | 0.623576 | 0.638704 | 0.640700 | 0.640366 | 0.980113 | 0.412698 | M | 0.983669 | 0.372170 | USA | MA | Wayland | 35907 |
| 31643 | 0.232071 | 0.356099 | 0.337052 | 0.336035 | 0.321602 | 0.288188 | 0.281428 | 0.287149 | 0.286179 | 0.285878 | 0.308425 | 0.222222 | M | 0.426710 | 0.489109 | USA | CA | Larkspur | 35908 |
| 31644 | 0.294444 | 0.466374 | 0.490146 | 0.492449 | 0.490174 | 0.459413 | 0.464193 | 0.484372 | 0.484245 | 0.484277 | 0.859850 | 0.253968 | M | 0.879595 | 0.746776 | USA | MA | Norwell | 35909 |
| 31645 | 0.257955 | 0.442565 | 0.463373 | 0.465868 | 0.457281 | 0.425718 | 0.424707 | 0.440023 | 0.439469 | 0.439680 | 0.785813 | 0.047619 | F | 0.613804 | 0.749498 | USA | CT | West Simsbury | 35910 |
| 31646 | 0.293308 | 0.492168 | 0.498274 | 0.498389 | 0.501686 | 0.472197 | 0.470687 | 0.485298 | 0.484704 | 0.484277 | 0.860476 | 0.317460 | F | 0.683282 | 0.831040 | USA | MA | North Andover | 35911 |
| 31647 | 0.242045 | 0.386487 | 0.382098 | 0.383105 | 0.366828 | 0.329385 | 0.320128 | 0.324766 | 0.322292 | 0.322470 | 0.464861 | 0.571429 | M | 0.555935 | 0.199914 | USA | PA | Lancaster | 35912 |